144 research outputs found

    Catalyst Acceleration for First-order Convex Optimization: from Theory to Practice

    Full text link
    We introduce a generic scheme for accelerating gradient-based optimization methods in the sense of Nesterov. The approach, called Catalyst, builds upon the inexact accelerated proximal point algorithm for minimizing a convex objective function, and consists of approximately solving a sequence of well-chosen auxiliary problems, leading to faster convergence. One of the keys to achieve acceleration in theory and in practice is to solve these sub-problems with appropriate accuracy by using the right stopping criterion and the right warm-start strategy. We give practical guidelines to use Catalyst and present a comprehensive analysis of its global complexity. We show that Catalyst applies to a large class of algorithms, including gradient descent, block coordinate descent, incremental algorithms such as SAG, SAGA, SDCA, SVRG, MISO/Finito, and their proximal variants. For all of these methods, we establish faster rates using the Catalyst acceleration, for strongly convex and non-strongly convex objectives. We conclude with extensive experiments showing that acceleration is useful in practice, especially for ill-conditioned problems.Comment: link to publisher website: http://jmlr.org/papers/volume18/17-748/17-748.pd

    Catalyst Acceleration for Gradient-Based Non-Convex Optimization

    Get PDF
    We introduce a generic scheme to solve nonconvex optimization problems using gradient-based algorithms originally designed for minimizing convex functions. Even though these methods may originally require convexity to operate, the proposed approach allows one to use them on weakly convex objectives, which covers a large class of non-convex functions typically appearing in machine learning and signal processing. In general, the scheme is guaranteed to produce a stationary point with a worst-case efficiency typical of first-order methods, and when the objective turns out to be convex, it automatically accelerates in the sense of Nesterov and achieves near-optimal convergence rate in function values. These properties are achieved without assuming any knowledge about the convexity of the objective, by automatically adapting to the unknown weak convexity constant. We conclude the paper by showing promising experimental results obtained by applying our approach to incremental algorithms such as SVRG and SAGA for sparse matrix factorization and for learning neural networks

    A Universal Catalyst for First-Order Optimization

    No full text
    main paper (9 pages) + appendix (21 pages)International audienceWe introduce a generic scheme for accelerating first-order optimization methods in the sense of Nesterov, which builds upon a new analysis of theaccelerated proximal point algorithm. Our approach consists of minimizing a convex objective by approximately solving a sequence of well-chosen auxiliary problems, leading to faster convergence. This strategy applies to a large class of algorithms, including gradient descent, block coordinate descent, SAG, SAGA, SDCA, SVRG, Finito/MISO, and their proximal variants. For all of these methods, we provide acceleration and explicit support for non-strongly convex objectives. In addition to theoretical speed-up, we also show that acceleration is useful in practice, especially for ill conditioned problems where we measure significant improvements

    FRIOD: a deeply integrated feature-rich interactive system for effective and efficient outlier detection

    Get PDF
    In this paper, we propose an novel interactive outlier detection system called feature-rich interactive outlier detection (FRIOD), which features a deep integration of human interaction to improve detection performance and greatly streamline the detection process. A user-friendly interactive mechanism is developed to allow easy and intuitive user interaction in all the major stages of the underlying outlier detection algorithm which includes dense cell selection, location-aware distance thresholding, and final top outlier validation. By doing so, we can mitigate the major difficulty of the competitive outlier detection methods in specifying the key parameter values, such as the density and distance thresholds. An innovative optimization approach is also proposed to optimize the grid-based space partitioning, which is a critical step of FRIOD. Such optimization fully considers the high-quality outliers it detects with the aid of human interaction. The experimental evaluation demonstrates that FRIOD can improve the quality of the detected outliers and make the detection process more intuitive, effective, and efficient

    Beyond Worst-Case Analysis in Stochastic Approximation: Moment Estimation Improves Instance Complexity

    Full text link
    We study oracle complexity of gradient based methods for stochastic approximation problems. Though in many settings optimal algorithms and tight lower bounds are known for such problems, these optimal algorithms do not achieve the best performance when used in practice. We address this theory-practice gap by focusing on instance-dependent complexity instead of worst case complexity. In particular, we first summarize known instance-dependent complexity results and categorize them into three levels. We identify the domination relation between different levels and propose a fourth instance-dependent bound that dominates existing ones. We then provide a sufficient condition according to which an adaptive algorithm with moment estimation can achieve the proposed bound without knowledge of noise levels. Our proposed algorithm and its analysis provide a theoretical justification for the success of moment estimation as it achieves improved instance complexity

    Nonlinearity of Subtidal Estuarine Circulation in the Pearl River Estuary, China

    Get PDF
    The Pearl River Estuary (PRE) is a bell-shaped estuary with a narrow deep channel and wide shoals. This unique topographic feature leads to different dynamics of the subtidal estuarine circulation (SEC) in the PRE compared with a narrow and straight estuary. In this study, the nonlinear dynamics of the SEC in the PRE under mean circumstance are analyzed by using a validated 3D numerical model. Model results show that the nonlinear advections reach leading order in the along-channel momentum balance. Modulated by tide, the nonlinear advections show significant temporal variations as they have much larger values during spring tide than that during neap tide. Unlike straight and narrow estuaries, both tidally and cross-sectionally averaged axial and lateral advections play important roles in driving the SEC in the PRE in which the axial advection dominates the nonlinear effect, but the two nonlinear terms balance each other largely resulting in a reduced nonlinear effect. Despite this, the total nonlinear advection is still comparable with other terms, and it acts as the baroclinic pressure to reinforce the SEC, especially during the ebb tide, suggesting a flood–ebb asymmetry of the nonlinear dynamics in the PRE. In addition, diagnostic analyses of the along-channel vorticity budget show that nonlinear advections also make significant contribution to drive the lateral circulation in the PRE as Coriolis and baroclinic pressure terms, indicating complex dynamics of the circulation in the PRE
    • …
    corecore